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The impressive success of the standard cosmological model has suggested to many that its ingre- 
dients are all one needs to explain galaxies and their systems. I summarize a number of known 
problems with this program. They might signal the failure of standard gravity theory on galaxy 
scales. The requisite hints as to the alternative gravity theory may lie with the MOND paradigm 
which has proved an effective summary of galaxy phenomenology. A simple nonlinear modified 
gravity theory does justice to MOND at the nonrelativistic level, but cannot be consistently pro- 
moted to relativistic status. The obstacles were first sidestepped with the formulation of TeVe S, 
a covariant modified gravity theory. I review its structure, its MOND and Newtonian limits, and 
its performance in face of galaxy phenomenology. I also summarize features of TeVe S cosmology 
and describe the confrontation with data from strong and weak gravitational lensing. 
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1. Prom standard cosmology to small scale structure 

The agreement between the measured light elements cosmic abundances and predictions by the 
theory of big bang nucleosynthesis and that between the measured spectrum of fluctuations in 
the cosmic microwave background (CMB) and predictions from the general relativistic theory 
of growth of cosmological perturbations have convinced many that the standard concordance 
cosmological model must be near the mark. To what extent are the expectations from this model 
for the issue of small scale structure confirmed? 

For my purposes here small scale structure refers to systems ranging from the scale of large 
clusters of galaxies down to those of dwarf galaxies. According to the standard model, all these 
originated from initially small fluctuations in the dark matter (DM) density distribution which 
grew prior to recombination. As the baryonic plasma became neutralized it began to fall into 
gravitational potential wells generated by the earlier collapsing DM. This was to give rise to all 
the small structure we see around us, each luminous galaxy embedded in a heavier halo of DM. 

In fact, well before the standard cosmological model was in place, Ostriker & Peebles [1] 
realized that in Newtonian gravity a pure disk galaxy supported solely by rotation is unstable to 
the formation of a bar. In view of the relative paucity of barred galaxies, they proposed that the 
typical disk galaxy nestles in the gravitational potential of a stabilizing massive encompassing 
invisible halo supported by random motion. This scenario was applied to explain the fact, just 
then becoming established, that rotation curves (RCs) of disk galaxies are quite flat outside 
the central regions. A flat rotation curve follows if a halo with r~ 2 density profile dominates 
the gravitational field outside the central regions. Modern dark halo modelling, inspired by the 
Navarro-Frenk- White DM halo density profile (NFW) [2] from cosmological simulations, has been 
widely acclaimed as explaining the shapes of the RCs. 

In the typical RC the passage from the central rising stage (dominated by the baryonic 
disk) to the flat part (dominated by the halo) is featureless. Bahcall & Cassertano [3] realized 
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that this requires the parameters of disk and halo to be fine tuned together. A more dramatic 
example of fine tuning is posed by the Tully-Fishcr (TF) relation for disks. In its modern form by 
McGaugh [4] the baryonic mass in the galaxy is accurately proportional to the fourth power of 
the rotation velocity (in the flat part), with a proportionality coefficient independent of galaxy 
type. Since the rotation velocity is dominated by the DM halo, the halo parameters must be 
fine tuned to the mass of the baryonic disk. But fine-tuning begs the question; can DM halo 
modelling get us out of it? 

The NFW halo's mass density profile starts like r _1 near the centre, and then steepens to 
r~ 3 in the outer parts. Only in the intermediate region is it like the r~ 2 profile that gives a 
flat RC; and indeed the models based on NFW do not well reproduce the observed flatness of 
RCs well beyond the optical edge of the galaxies. Further, in NFW halos the DM mass scales 
as the velocity dispersion cubed, and virial arguments tie the RC amplitude to the DM velocity 
dispersion. How to get the Mat) 4 TF law? It has been a pious hope of galaxy DM proponents 
that in the course of infall of baryonic gas into a DM halo potential well, gas dynamical processes 
will fine tune disk and halo together to give the TF relation. This fine tuning should also — so the 
hope — erase any incipient feature at the joint between the disk dominated rising part of the RC 
and the halo dominated flat part. But the TF correlation is among the tightest in the whole of 
astrophysics; in fact, it is about as narrow as allowed by the observational errors. As Bob Sanders 
emphasized to me, turbulent gas dynamics would naturally lead to a spread in the amount of 
gas mass falling into each of a set of identical halo potential wells. This will sabotage the process 
attempting to fine tune, even if it exists. 

The central r _1 mass density profile of the NFW halo means there is a mass cusp at the 
centre. In low surface brightness galaxies (LSBs) the baryon density (mostly gaseous) is so low 
that the halo should dominate dynamics everywhere. But careful study of extant RCs by Chen 
& McGaugh [5] does not disclose the fast motions near the centres that would be expected in 
the presence of cusps. Again, one must rely on the hope that gas dynamics feedback on the halo 
have flattened the cusps. 

Cosmological simulations predict that NFW halos form in a hierarchy, with small halos arising 
together with each big one. If each halo eventually cuddles a galaxy, with galaxy baryonic mass 
increasing monotonically with halo mass, there end up being many more small satellite galaxies 
per big galaxy than observed in our neighbourhood (galaxy surveys are now showing this is 
also true elsewhere). This embarrassment could again be resolved if gas dynamics is such that 
the small halos remain unpopulated by baryons and thus invisible. Such eventuality could be 
checked with gravitational lensing which is also sensitive to sterile halos. (Claims of detection of 
the corresponding small scale substructure around big galaxies have not been confirmed.) 

2. The MOND paradigm 

Thus, while the standard cosmological model may be convincing on the grounds of its predictive 
successes, as we follow its logical implications towards smaller scales, we meet with embarrass- 
ments. Of course all these could be the product of astrophysical complications not yet accounted 
for. But, logically, the possibility cannot be excluded that standard gravity theory, which is 
implicitly assumed in all the above deductions, is failing on small scales. 

As the ubiquity of the flat RCc and the wide applicability of the TF luminosity vs rotational 
velocity were being internalized, Milgrom [6] realized that the naive Newtonian description of a 
galaxy containing only baryonic matter fails in a way that correlates with the system's typical 
acceleration (in units of a certain scale ao), rather than with its length or mass scales. (There 
is a beautiful pair of graphs in McGaugh & Sanders' comprehensive review [7] that illustrates 
this with modern data.) While Milgrom was not the first to suggest a departure from Newtonian 
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dynamics as the resolution of the above problems, his phenomenological scheme — MOND — has 
been by far the most fruitful in this respect. 

MOND posits [6,8] that in astronomical systems the acceleration a is related to the Newtonian 
gravity field gpf sourced by baryonic matter by 

/i(|a|/ao)a = gjv. (2.1) 

Here jl is a function of positive argument interpolating between at low argument and 1 at large 
argument (both compared to unity), for example, the "simple" interpolation function 

fi{x) = (2.2) 
l + x 



Applied to test particles in an ambient gravitational field, equation (|2.ip can be used to predict 
rotation curves from the observed baryonic mass distribution. It is easily seen that this relation 
forces the RC to become flat in a disk's outskirts, where a < ao (postdiction of observed sys- 
tematics) and also requires the baryonic total mass M (source of g N ) to scale as v\ where v c is 
the rotation velocity in the flat part of the RC (prediction of the modern form of the TF law) . 
Other early predictions of MOND in this simple form are [7,8]: 

• A LSB (defined as a disk whose surface mass density is everywhere below Ga$) should 
have an especially large mass discrepancy (ratio of dynamically inferred mass to visible 
baryonic mass). When years later the first LSB RCs were measured, this prediction was 
corroborated. 

• A LSB's RC must always rise, in contrast to a higher surface density disk whose RC rapidly 
peaks and then descends to a plateau. The predicted detailed shape of the LSB's RC is 
independent of the choice of jl function. The rising behaviour of the RC is well documented 
today, and MOND predictions of LSB RC shapes are especially successful. 

• As a rule globular clusters must be well described Newtonially in terms of visible stellar 
mass because their internal Newtonian fields exceed ao- This is now well established; yet 
at the time of the prediction it was unexpected that globulars should have no DM halos 
(even today it is not clear why they do not). 

• Dwarf spheroidal galaxies (sparse assemblies supported by random stellar motion) should 
have big mass discrepancies because they all have surface mass densities well below Gag- 
The large dynamical mass-to-light ratios of dwarf ellipticals are well documented today. 

• MOND can predict the presence of features in the RC of a disk with normal surface mass 
density in terms of the disk's detailed baryonic mass density profile, provided one uses the 
"correct" jl. The trend today is to use equation (|2.2[) . Over 100 fairly well measured RCs of 
such galaxies have been fit by MOND, and as a rule these fits are as good as any DM halo 
fit. In contrast to the classic one-parametric NFW halo family, models used in DM halo 
fits [9] have two parameters, the halo's core radius (to excise the embarrassing cusp), and 
its central velocity dispersion. To these is added the mass-to-light ratio Y* of the galaxy's 
stellar component. MOND is more parsimonious, requiring only one parameter, T*; yet it 
gives fits as good or better than do the DM models. 

Nobody is prefect, and MOND is no exception. While it gives a better account of the velocity 
dispersions in clusters of galaxies than does Newtonian theory without DM, it still requires the 
addition of unseen material [7] . Ironically, even this failure led to a good early prediction; Milgrom 
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suggested that the problem results from the existence, in all clusters, of large quantities of gas 
which had not yet been seen [10]. In the mid 1980's such X-ray emitting gas was indeed found 
in clusters, to the tune of five times the mass in galaxies or so. While this finding tends to help, 
the gas is not enough to wholly vindicate MOND, which still must rely on an amount of invisible 
matter comparable to that of the gas mass. 

Another partial success of MOND has to do with giant and dwarf ellipticals. Observationally 
these fall on a "fundamental plane" FP in the space of central velocity dispersion, effective radius 
and mean surface brightness. The locus predicted by Newtonian theory for virially equilibrated 
galaxies with constant Y* is tilted with respect to the observed FP. This can be fixed by appealing 
to a DM halo as well as strong velocity anisotropy. According to a recent detailed study by 
Cardone et al. MOND does better but leaves some tilt too [11]. Possibly taking into account 
the external field effect (sec of the nearby galaxies (ellipticals are mostly found in densely 
populated regions) will mend the problem. 

Mild failures aside, it is clear that there is a broad range of masses, 10 6 — 1O 11 M , in which 
systems adhere to MOND in their systematics. This must be telling us something; logically there 
are the following possibilities, a) MOND is merely an efficient summary of the way DM is dis- 
tributed in the said systems? b) MOND reveals the dependence of inertia on acceleration for 
small accelerations? c) MOND betrays hitherto unknown forces particularly effective at astro- 
nomical scales? d) MOND encapsulates departures from standard Newtonian-Einsteinian gravity 
theory at the mentioned scales? 

Possibility a) has to contend with the fact, mentioned above, that predictions from dynamical 
simulation for DM halos (NFW) run into observational problems (e.g. cusps, paucity of satellites). 
The hope that backreaction of the gas dynamics on the halos will resolve all problems seems to 
me to be a wishful hope. The best theory for option b) is a nonlocal theory of inertia which 
is extremely hard to calculate with [12]. One is stymied in this direction. Option c) alludes to 
the "fifth force" paradigm. Were the fifth force weaker than Newtonian gravity by an order of 
magnitude, it would become irrelevant for our galactic scale problem. Were it stronger by an 
order of magnitude, it would swamp gravity and contravene the facts. Such fifth force must 
couple only to mass, just as does gravity, in order to preserve the universality of free fall which 
we seem to see working in galactic orbits. Thus we may just as well think of this force as a facet 
of gravity. Possibility d) is my topic here. My talk is an update of several reviews, which should 
be consulted for details and a fuller list of references [13-15]. 

3. MOND as gravity 

Formula (|2.1j) is good for test particles in an external field. Applied to a composite system, e.g. a 
binary galaxy, it entails nonconservation of momentum, energy and angular momentum (because 
the inertia of each particle can depend on its acceleration). A consistent embodiment of the 
MOND paradigm as a gravity theory must be based on a Lagrangian to enforce the conservation 
laws. In fact, if in the usual Newtonian Lagrangian density we make the replacement 

|V^| 2 =>^F(|V*| 2 /a§) (3-1) 

will do the trick; here V$tv = — gw while $ is the true gravitational potential (a = — V$) 
that couples to mass density p just as $at does in the Newtonian theory. The Euler-Lagrange 
procedure leads to the modified Poisson equation [16]: 

V • [/2(|V$|/a )V$] = SnGp; p,(y/x) = dF(x)/dx. (3.2) 

Here jl(x) is identified with that in equation (j2.1[) . This theory is called AQUAL (for AQUAdratic 
Lagrangian); it respects the usual conservation laws. 
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For spherical, cylindrical and planar geometry, the solution of equation (|3.2[) together with 
a = — V<I> gives formula (I2.1[) : AQUAL is a decent nonrelativistic theory embodying the MOND 
paradigm. For lower symmetry the corrections are important, and have been studied somewhat 
with numerical codes. Among the additional insights issuing from AQUAL I mention 

• AQUAL explains why an internally high acceleration object, e.g. a star, orbits in a galaxy 
according to the MOND dynamics that pertain to the low acceleration of the orbit of its 
centre of mass [16]. 

• AQUAL confirms the suspected existence of the external field effect: an internally weak 
acceleration object, e.g. an open cluster, subject to an external held > ao has nearly 
Newtonian dynamics [6,16]. 

• AQUAL allows study of dynamical evolution of multiple systems, e.g. binary galaxy [17]. 

• AQUAL gives a systematic account of regularities in pressure supported systems, e.g. early 
type galaxies [18]. 

These successes support the notion that MOND encapsulates some deviation of gravitation 
physics from the standard Newton-Einstein paradigm. Of course one would also like a rclativistic 
version of the theory to confront data from gravitational lenses, cosmology, etc. In AQUAL we 
managed to build the Lagrangian because the fundamental quantity V<& has the meaning and 
dimension of the acceleration held, and can thus be compared with ao to delineate the MOND 
dominant region. In the geometric (Einstein) formulation of gravity theory, the analog of V$ is 
the connection or Christoffel symbol associated with the metric. But this last is not a tensor, 
and so cannot be used undifferentiated to form a covariant Lagrangian. Worse, by judicious choice 
of coordinates one can make = locally, but obviously the MOND regime should not be 
delineated by the choice of coordinates! We really need a different acceleration held. 

The simplest thing to do is to replace AQUAL 's $ by a relativistic scalar held ip bear- 
ing the same dimensions, those of c 2 . We keep the metric g a p of general relativity (GR) gov- 
erned by the usual action. But we break with GR by requiring that matter "live" in the metric 
9a/3 — exp(2ip I c 2 ) g a p conformally related to g a p. The Lagrangian for ip is a covariant version of 
that for $ in AQUAL; the corresponding equation is a wave equation version of equation (|3.2p 
with p replaced by minus the trace of the energy-momentum tensor of the matter, T. All this 
characterizes R(elativistic) AQUAL [16]. Analysis of nonrelativistic motion identifies the poten- 
tial $ with $jv+V'- For quasistatic nonrelativistic systems one can ignore time derivatives as well 
as replace T by — p; then the equation determining ip is essentially equation (|3.2[) . Thus in the 
regime where |V-0|c~ 2 /ao <C 1, |V-0| 3> |V$at| and one can take $ w ip and so recover AQUAL. 
RAQUAL thus inherits all good properties of AQUAL and reproduces MOND. But two serious 
dehciencies of RAQUAL were apparent early [16]. 

• Small perturbations of ip about a background solution propagate superluminally in regions 
with low acceleration. 

• The gravitational deflection of light in RAQUAL is about the same as that in GR for the 
same mass. 

Superluminality raises the spectre of acausal behaviour; it is worrying, but perhaps not dis- 
abling since it does not run in the face of any known observations. However, already by the early 
1980's it was clear that in clusters of galaxies which happen to gravitationally lense quasars, the 
deflection of light is much larger than anticipated by GR models with no DM included. Thus 
RAQUAL, which as a matter of principle eschews DM, runs counter to the facts. Wherein lies 
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the problem? First, the energy momentum tensor of tp itself is quite negligible compared to that 
of the matter, so the metric g a p is about the same as in GR. Second, the metric g a p, to which 
electromagnetic fields couple, is conformal to g a p, and it is well known that conformally related 
metrics are totally equivalent with respect to electrodynamics. The solution to the problem is 
thus obvious: make the passage from g a p to g a p nonconformal. For example, the two metrics 
could be related disformally, g a p = exp(2^/c 2 )(<7 Q/ 3 +w^ >a ip > p), with w a function of the invari- 
ant norm of ip,a [19]. The requirements that g a p possess an inverse, that the null cone of g a p 
not lie outside that of g a /3 (no superluminal propagation of gravitons) and that ip bear positive 
energy density force w < 0. Calculation then shows that light deflection is actually a little weaker 
in the new theory than in RAQUAL [20] . So purely scalar-tensor scalar theories fail to cope with 
the facts of life. 

What the scalar field cannot accomplish alone, it may accomplish with help from a vector 
field. Sanders proposed the disformal relation (from now on I set c = 1) 

g al 3 = e~ 2<t> {g aP +U a U p ) - fMujAp, (3-3) 

where is a new scalar field and U a is a constant unit vector field directed along the time 
flow. With AQUAL dynamics for Sanders' "statified" theory not only recovers the MOND 
phenomenology, but also give lensing of the right strength [21]. Of course, a constant vector field 
runs counter to the spirit of covariance, so the theory is still not a consistent relativistic MOND. 

For years the repeated failures to produce such a consistent theory had fuelled opposition to 
MOND by DM pundits. The introduction of TeVe S [22] ended that phase of the confrontation 
by exhibiting an example of a consistent relativistic implementation of MOND. Today there are 
already quite a few gravity theories which claim to make the MOND paradigm relativistic. Let 
me focus on TeVe S in a slightly more sophisticated form than it took at its inception. 

4. TeVe S, its structure and limits 

Like RAQUAL or Sanders stratified theory, TeVe S has two metrics, the Einstein metric g a p and 
the physical metric g a p, related via equation (|3.3[) . but with U a now regarded as dynamical. The 
gravitational action (written exclusively with g a p, which also serves to raise indices within it) is 
the sum of three parts. First, the usual Einstein-Hilbert action. Second, the scalar's action 

S s = -^^jF(keh^<t>, a <t>,p){-g) 1 ' 2 d l x ] h a < i =9'#-g a *(f v U l M V) (4.1) 

wherein J 7 is a positive function, A; is a dimensionless coupling constant, and £ is a constant scale 
of length. And thirdly the vector's action 

S v = -32^ J [Kg^g^U M U [M + K{g^U a ^f - 2X(g^U t M v + 1)] (-g) 1/2 d 4 x, (4.2) 

where the square bracket denotes antisymmetrisation. The vector's action sports a pair of dimen- 
sionless coupling constants, K and K and a Lagrange multiplier field A charged with enforcing 
normalization of U a to -1. Originally [22] S v comprised only the K term. Contaldi et al. [23] 
criticized TeVe S for the tendency of U a to form caustics, and suggested addition of the K term 
as a remedy; this was first used by Skordis [24] and Sagi [25]. Actually there are two more terms 
that are quadratic in derivatives of U a and have also been invoked (the set of four terms is often 
referred to as Einstein- Aether dynamics [26]). For simplicity I leave the last two terms out. 

Add now the standard action for matter and nongravitational radiation. It is written exclu- 
sively with g a p in order to comply with the weak equivalence principle, and to set TeVe S apart 
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from standard GR. In practice the consequent energy-momentum tensor T a p is taken as a sum 
of perfect fluid contributions, one for dust-like matter and one for radiation. 

Zlosnik, et al. [27] have shown how to reformulate TeVe S solely in terms of the metric g a p 
(which turns out to have an Einstein- Hilbert action), and the vector field; the <f> is eliminated with 
the help of the unit vector constraint. In this form TeVe S might figuratively be described as GR 
with a DM (the vector field). This form of the theory is useful for clarifying certain conceptual 
issues, but because of the intricate form of the vector action, it is much harder to calculate with. 
So I continue to discuss TeVe S in its bimetric form. 

In the limit K — > 0, K — > and I — > oo with k ~ £~ 2 / 3 , TeVe S tends to GR for quasistatic 
systems and homogeneous cosmology [22]. It is likely that it does so for more general situations. 
It is thus advisable to keep K 1 K and k small compared to unity in order not to stray too far from 
GR which, after all, is well tested in the solar system. The above does not mean that TeVe S at 
large accelerations automatically becomes GR. 

To define "large accelerations" one must commit to a particular T: every choice of T defines 
a separate TeVe S theory. I suggested a toy T in my original paper [22] . Alternatives have been 
suggested by Sanders [28] and by Angus, et al. [29], the latter subsuming as special cases my toy 
T and that corresponding to the simple ft, equation (12.21) . One then derives the scalar equation 

[n(k£ 2 h"< 5 4, 7 £ s )h a ^, a ]. p =kG [g af3 + (1 + e" 4 ^) U a U^\ f a0 ; fx(y) = F\y). (4.3) 

Recalling that in the nonrelativistic limit the trace g a ^T a p tends to — p, while U a U^T a p stands 
for energy density measured by an observer with 4- velocity U a , it is clear that in the quasistatic 
nonrelativistic regime this last equation is very much like equation (|3.2[) . 

Now by linearizing the relation (|3.3[) using a first approximation to U a , one concludes that 
$ sa $jv + (f>- Thus, at least for quasispherical situations, TeVeS reduces to AQUAL with 

ft fa (1 + k/inn)- 1 . (4.4) 

The T must be chosen so that this ft becomes linear in \V&\ at small gradients (small accelera- 
tions). For example, with my toy T Milgrom's constant comes out as 

a = Vsk/Aire. (4.5) 

For weak fields (|$| <C 1) and large accelerations ( | V<E > | ^> ao) the ft must approach unity 
so that the theory's predictions tend to those of Newtonian gravity. This is indeed the case for 
my toy T and for Sanders' choice of /i. Case a. = 1 of Angus, et al.'s interpolating n function 
fails: it gives a vanishing effective Newton constant. For a good T the Newtonian limit is gotten 
by taking I — > oo in the weak field regime; one does not have to take K — > and K — » 
which would be required to get GR. One may thus envisage tests of TeVe S vs GR in strong 
gravity. For example, Lasky, Giannios and Sotani in various combinations [30,31] have considered 
the differences obtaining in neutron stars and black holes between GR and TeVe S, the latter 
sometimes generalized to include in the action all four terms quadratic in U i0l . 

For weak fields and large accelerations one has to distinguish between the Newtonian and 
post-Newtonian behaviours. The /3 and 7 parametrized post-Newtonian (PPN) coefficients of 
even the simplest TeVeS (K = 0) agree with those of GR [22,31]. The four £ "conservation 
coefficients" , Whitehead's £ and the preferred frame coefficient of generalized TeVe S all 
vanish (in harmony with GR) with no special demands on the theory's parameters. Finally, 
the preferred frame coefficients ai,a!2, though nonvanishing, can be made to comply with the 
stringent solar system experimental constraints in appropriate ranges of the four K and single 
k coupling coefficients of fully generalized TeVe S [25] . TeVe S thus squeaks through the PPN 
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filter; however, its recovery of the Newtonian limit leaves much to be desired. It is true that as 
|V$|/ao grows well beyond unity, V<£> — » V<&at, but the approach is not as fast as one would 
like. Consequently the Kepler constant a 3 P~ 2 is not exactly constant from planet to planet 
as would be required by Kepler's third law. This prediction may already be in conflict with 
modern ephemerides, and the problem may not be solvable within the general framework of 
tensor- vector-scalar type theories [28]. 

Thus TeVe S (and theories like it) may not be the final word. But how does the theory fare 
in light of galaxy phenomenology, gravitational lensing and cosmological data? On the answers 
hinges the decision whether one needs to look elsewhere for a panacea alternative to DM. 

5. Gravitational lensing and cosmology in TeVeS 

In the nonrelativistic regime TeVe S gives back AQUAL/MOND O- This is shown rigorously 
for spherical systems [22], but apparently also goes through in the presence of considerable 
asymmetry [29]. Turning to relativistic issues, recall that in RAQUAL (with either conformal or 
disformal relations between the two metrics) , the deflection of light is hardly different from that 
which GR would predict with no DM — in conflict with observations. By contrast in TeVe S the 
(single) potential responsible for weak field light deflection is the same as that which appears in 
AQUAL/MOND [22]. Since this last describes the dynamics of a galaxy as well as an optimal 
DM halo model (it produces the same potential: 0, its predictions for lensing will thus be 
comparable to those of GR with an optimal distribution of DM. 

However, this blanket statement has to be qualified. AQUAL/MOND with the customary ft 
predicts that a galaxy's RC must remain flat to arbitrary large radius; the logarithmic potential 
<J> that accomplishes this will induce light bending of the form caused by an isothermal sphere 
in standard gravity theory. By contrast, the limited extent of DM halos should cause RC's to 
droop at large distances, and force the light bending at large impact parameter to approach that 
of a point mass. The Keplerian descent of the RCs is not seen. But since gravitational lensing 
can probe a galaxy's potential to larger radii than can radio and optical Doppler measurements, 
do gravitational lenses resolve the above confrontation? 

Microlensing in our galaxy aside, gravitational lenses and their objects lie at cosmological 
distances. Thus theoretical predictions for lensing a la TeVe S must use a genuine TeVe S cos- 
mological model for consistency. This was first done by Chiu et al. [32] and Zhao et al. [33]. In 
line with the early finding that TeVe S isotropic models with baryonic matter content alone re- 
semble the corresponding GR models because the scalar 0's energy density stays small [22], these 
workers find the former to provide just as consistent a scaffolding for the analysis of cosmological 
gravitational lenses as do isotropic GR models. 

Chiu et al. [32] derived the TeVe S lensing equation and applied it to strong lensing (multiple 
images). Most gravitational lenses are giant elliptical galaxies. Chiu et al. modelled the baryonic 
mass distribution of such a galaxy by a point; an Hcrnquist mass model is probably better. 
Zhao et al. employed both in their comparison of TeVe S's predictions with a sample of quasars 
doubly imaged by intervening galaxies from the CASTLES catalogue. They estimated the lens 
mass in each case by comparing predicted and observed image positions or amplification ratios; 
the two methods gave consistent results. The derived mass-to-light ratios fall in the normal range 
for stellar populations, with some outliers. Shan et al. [34], who took departures from spherical 
symmetry into account, also found agreement with TeVe S for those in their sample of 15 doubly 
imagined quasars with lenses not situated in clusters (to exclude the external field effect). 

The opposite conclusion was urged in a trilogy of papers by Sakellariadou and coworkers [35] . 
They concluded that TeVe S will not work for galaxy lenses in the CASTLES catalogue without 
help from invisible matter. In the first paper Ferreras et al. actually calculated lensing using a 
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mixture of MOND and GR instead of TeVeS, a procedure known to be misleading [15, 32]. In 
the second attempt, Mavromatos ct al. reiterate the above claim based on comparison of their 
TeVe S mass estimates for 18 lenses in CASTLES with the masses inferred from luminosity. But 
recently Chiu et al. [36], who carefully repeated the comparison for 10 of the lenses in question 
which can be modelled as spherical, reached the opposite conclusion that with the ft deriving 
from the toy T or the simple ft, TeVe S works adequately without requiring invisible mass. They 
impute the conclusion of Sakellariadou and coworkers to improper comparison of the inferred 
lensing mass with the stellar mass implied by the optical intensity through a fixed aperture. 

The distribution by angular separation of the two images in lensed quasars has proved trou- 
blesome for the DM paradigm. In TeVeS it has been investigated by Chen & Zhao [37] and 
lately by Chen [38]. Again modelling the mostly elliptical galaxies with Hcrnquist profiles, and 
describing their space distribution with the Fontana function, these workers compare predictions 
of, first TeVeS for a purely baryonic universe with cosmological constant, and second GR with 
DM and baryons, with the observed statistics in the CLASS/ JVAS quasar survey. They report 
that TeVeS comes out on top. The above is for spherical mass models of the galaxies; a step 
towards the modelling of asymmetric lenses within TeVe S has been taken by Shan et al. [34] . 

When the colliding clusters or "bullet cluster" system 1E0657-56 was discovered [39], it was 
proclaimed as a "proof of dark matter" . Weak lensing mapping (of the distortions of background 
galaxies) seems to show the main gravity sources to lie with the galaxies, rather than in the 
ejected hot gas which dominates the visible baryonic mass. Collisionless DM would indeed be 
expected to get separated from the dissipational gas and go along with the galaxies; hence the 
GR based conclusion that much DM exists in 1E0657-56. Angus et al. [40] analysed 1E0657-56 
within TeVe S and concluded that Sanders' suggestion [41] that clusters contain, along with the 
baryons, much mass in few-eV neutrinos might just save the day. Confirming that the source of 
gravity in 1E0657-56 must include an invisible component, Feix et al. [42] assert that nonlinearity 
of the AQUAL equation cannot prevent the lensing from tracking the baryonic matter (just the 
opposite is seen). But as Ferreira & Starkman remark [43], the absence in TeVeS of a Birkhoff- 
type theorem allows nearby structures, e.g. nearby clusters, to influence lE0657-56's gravitational 
field. This effect has not been factored into any of the above arguments against TeVeS. 

It is not the asymmetry of the "bullet" system which is the reason for TeVe S's woes. Taka- 
hashi & Chiba [44] analysed observed weak lensing by three quasispherical clusters. For each 
they obtained the gravitational field with TeVeS using several acceptable ft functions and a 
model mass profile inferred from a large cluster sample from the SDSS survey. Their predicted 
shear and convergence of the lensed light failed to fit the observations unless they added an un- 
rcalistically large mass in neutrinos. Similar conclusions are reached by Natarajan & Zhao [45]. 
In retrospect it is no wonder that TeVe S has problems with cluster lensing. After all, MOND 
dynamical models of clusters do not clear all the missing mass problem, and TeVe S is built on 
a MOND foundation. 

Obviously TeVe S's performance with clusters is weaker than that with galaxies. How does it 
do as we go to the domain of large scale? Turn to cosmology. The Friedmann models of GR can 
be transplanted to TeVe S [22] . One can ask whether G and ao are really constants in TeVe S 
cosmology. Whereas scalar tensor theories, e.g. Brans-Dicke's, generically have a variable effective 
gravitational constant, G is strictly constant in TeVeS isotropic cosmology [46]. By contrast, ao 
varies, but does so on a timescale much longer than Hubble's [46], at variance with some early 
speculations on MOND [6]. Since ao can be determined from a disk galaxy's RC, it should be 
possible to check this consequence of TeVe S with RC's at z <~ 2 — 5; these RC's will be available 
within years. 

Friedmann models are overly idealized. In his exemplary paper Skordis [47] worked out the 
full covariant formalism for growth of cosmological perturbations in TeVeS. Skordis et al. [48] 
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then used it to show that TeVe S's predictions can be made consistent with the power spectrum 
of the galaxy distribution and that of the CMB fluctuations, if one allows for a contribution of 
massive neutrinos to the tune of Q, v = 0.17, as well as cosmological constant, but no standard 
DM. The role of DM is here taken over by a feedback mechanism originating in the scalar field [47] 
(or in the vector field according to a variant opinion [49]). Thus power spectra do not cleanly 
distinguish TeVe S from GR. 

Dodelson and associates [50,51] pointed out that different modified gravity theories lead to 
relations between the matter overdensity and the local depth of the cosmological gravitational 
potentials, relations different from the standard GR ones. These are reflected in different inte- 
grated Sachs- Wolfe signals for each theory which can be brought out by examining the correlation 
between CMB temperature fluctuations and galaxy number overdensity [50]. Alternatively, the 
different relations arc reflected in different correlations between gravitational lensing convergence 
(and shear) and galaxy number overdensity [51]. Both of the above procedures are confused by 
biasing (the unknown ratio of matter to galaxies clustering) . Zhang et al. have proposed an esti- 
mator, Eg, which should make it possible to factor out biasing while correlating fluctuations in 
galaxy number overdensity with lensing [51]. They conclude that planned surveys in the optical 
by the LSST, or in the radio by the SKA, will reach the precision requisite to distinguish TeVeS 
from GR. 

Recently Reyes, et al. [52] estimated Eg from a sample of 7 x 10 4 giant ellipticals with a 
mean z — 0.32, and claimed a statistically significant difference between the predicted Eg for a 
TeVe S' model and the measured one, whereas GR's Eg fit well. The TeVe S cosmological model 
in question [51] comes from TeVeS with only the K term in vector action (14.21) . the version 
known to develop caustics [23] and not to fit all the measured PPN coefficients [25]. And the 
cited prediction for Eq is actually for a redshift interval larger than the observed one (TeVe S's 
and GR's predictions for Eg converge at low redshift [51]). Finally Reyes et al.'s error bars are 
large; a much larger sample is required for a statistically significant discrimination between the 
theories [51]. The claim of Reyes et al. paper's title to have established GR is surely premature. 

I thank Joao Magueijo for useful remarks, and the organizers for the invitation. 
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